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Abstract — In this paper, we consider tlie problem of state 
estimation tlirougli observations possibly corrupted with both 
bad data and additive observation noises. A mixed £i and £2 
convex programming is used to separate both sparse bad data 
and additive noises from the observations. Through using the 
almost Euclidean property for a linear subspace, we derive a 
new performance bound for the state estimation error under 
sparse bad data and additive observation noises. Our main 
contribution is to provide sharp bounds on the almost Euclidean 
property of a linear subspace, using the "escape-through-a-mesh" 
theorem from geometric functional analysis. We also propose and 
numerically evaluate an iterative convex programming approach 
to performing bad data detections in nonlinear electrical power 
networks problems. 

I. Introduction 

In this paper, we study the problem of state estimation 
under both bad data and observation noise. In state estimation 
problems, the observations may be corrupted with abnormally 
large measurement errors, called bad data, in addition to the 
usual additive observation noise. More specifically, suppose we 
want to estate the state x described by an m-dimensional real- 
numbered vector, and we make n measurements, then these 
measurements can be written as an rt-dimensional vector y, 
which is related to the state vector through the measurement 
equation 

y /i(x) + V + e, (I.l) 

where /i(x) is a nonlinear function relating the measurement 
vector to the state vector, and v is the vector of measurement 
noise, and e is the vector of bad data imposed on the 
measurement. In this paper, we assume that v is an m- 
dimensional vector with i.i.d. zero mean Gaussian elements 
of variance cr^. We also assume that e is a vector with only 
k nonzero entries, and the nonzero entries can take arbitrary 
real-numbered values, reflecting the nature of bad data. 

It is well known that Least Square (LS) method can be used 
to suppress the effect of observation noise on state estimations. 
In LS method, we try to find a vector x minimizing 



Mx)||. 



(1.2) 



However, the LS method generally only works well when there 
are no bad data e corrupting the observation y. 

In this paper, a mixed least £1 norm and least square convex 
programming is used to simultaneously detect bad data and 
subtract additive noises from the observations. In our theoret- 
ical analysis of the decoding performance, we assume h{x.) is 
a linear transformation with H as an n x m matrix with 
i.i.d. standard zero mean Gaussian entries. Through using the 
almost Euclidean property for the linear subspace generated by 



H, we derive a new performance bound for the state estimation 
error under sparse bad data and additive observation noises . In 
our analysis, using the "escape-through-a-mesh" theorem from 
geometric functional analysis L5|, we are able to significantly 
improve on the bounds for the almost Euclidean property 
of a linear subspace, which may be interesting in a more 
general mathematical setting. Compared with earlier analysis 
on the same optimization problem in [11], the analysis is new 
using the almost Euclidean property rather than the restricted 
isometry conditions used in and we are able to give 

explicit bounds on the error performance, which is sharper 
than the analysis using the restricted isometry conditions in 

Ell. 

Inspired by bad data detection methods for linear systems, 
we further propose an iterative convex programming approach 
to perform combined bad data detection and denoising in 
nonlinear electrical power networks. The static state of an 
electric power network can be described by the vector of bus 
voltage magnitudes and angles in power networks. However, 
in smart grid power networks, the measurement of these 
quantities can be corrupted due to errors in the sensors, com- 
munication errors in transmitting the measurement results, and 
adversarial compromises of the meters. So the state estimation 
of power networks needs to detect, identify, and eliminate 
large measurement errors [1], [2], [3 |. Since the probability 
of large measurement errors occurring is very small, it is 
reasonable to assume that bad data are only present in a small 
fraction of the available meter measurements results. So bad 
data detection in power networks can be viewed as a sparse 
error detection problem, which shares similar mathematical 
structures as sparse recoveries problem in compressive sensing 
El, ifTTl . However, this problem in power networks has several 
unique properties when compared with ordinary sparse error 
detection problem [4|. In fact, h{x) in (11.11) is a nonlinear 
mapping instead of a linear mapping in lITTI . Our iterative 
convex programming based algorithms work is shown by 
numerical examples working well in this nonlinear setting. 
Compared with IIT2I . which proposed to apply £1 minimization 
in bad data detection in power networks, our approach offers 
a better decoding error performance when both bad data and 
additive observation noises are present. llT3llfT4l considered 
state estimations under malicious data attacks, and formulated 
the problem of state estimation under malicious attacks as a 
hypothesis testing problem by assuming a prior probability 
distribution on the state x. In contrast, our approach does not 
rely on any prior information on the signal x itself, and the 
performance bounds hold for arbitrary state x. 

The rest of this paper is organized as follows. In Section 



mi we introduce the convex programming to perform joint 
bad data detection and denoising, and derive the performance 
bound on the decoding error based on the almost Euclidean 
property of linear subspaces. In Section [III] a sharp bound on 
the almost Euclidean property is given through the "escape- 
through-mesh" theorem. In Section HV] we will present explicit 
bounds on the decoding error. In Section |V] we introduce our 
algorithm to perform bad data detection in nonlinear systems, 
and present simulation results of its performance in power 
networks. 

II. Bad Data Detection for Linear Systems 

In this section, we will introduce a convex programming 
formulation to do bad data detection in a linear systems, and 
give a characterization of its decoding error performance. In a 
linear system, the nxl observation vector is y = Hx + e + v, 
where x is the m x 1 signal vector (m < n), e is a sparse 
error vector with k nonzero elements, v is a noise vector with 
||v||2 < e. In what follows, we denote the part of any vector 
w over any index set K' as wk'- 

We solve the following optimization problem involving 
optimization variables x* and z, and we then estimate the 
state X to be x, which is the optimizing value for x* . 



min 1 1 y ^ Hx — z 1 1 1 , 

X* .z 



subject to 



|z||2 < £• 



(II. 1) 



We are now ready to give the main theorem which bounds the 
decoding error performance of (III.lt . 

Theorem 2.1: Let y, H, x, e and v are specified as above. 
Suppose that the minimum nonzero singular value of H is 
(Tmin- Let C be a real number larger than 1, and suppose that 
every vector w in the subspace generated by the matrix H 
satisfies C||wa'||i < for any subset K C {l,2,...,n} 

with cardinality \K\ < k, where k is an integer, and K = 
{1, 2, n} \ K. We also assume the subspace generated by 
H satisfies the almost Euclidean property for a constant a < 1, 
namely 

Q!v^|jw||2 < ||w||i 



holds for every w in the subspace generated by H 
Then the solution x satisfies 

2{C + 1) 



X 2 



< 



(Tmma{C - 1) 



(II.2) 



Proof: Suppose that one optimal solution set to dll.ll ) is 
(x,z). Since ||z||2 < e, we have ||z||i < \/ri||z||2 < \fn€. 

Since x* = x and z = v is a feasible solution for jlLll i. 
then 

l|y-^x-z||i 

= ||if(x-x)+e + v-z|li 
< ||i?(x-x)+e + v-v||i 
= l|e|li. 

Applying the triangle inequality to — x) + e + v — z||i. 



we further obtain 

||ff(x-x)+e||i-||v||i-||z||i < ||e||i. 

Denoting H{x — x) as w, because e is supported on a set 
K with cardinality \K\ < k,hy the triangle inequality for £i 
norm again, 

||e||i - llwA'lli + ||w;^||i - i|v||i - i|z||i < ||e||i. 
So we have 

- llwA'lli + ||w-^||i < l|z||i + j|v||i < 2V^e (n.3) 
With C||wa'||i < we know 



C- 1, 



|w||i < -IIwaIIi + ||w7^ 



c + v 

Combining this with ( III. 31 ). we obtain 

C-1m „ ^ r- 
w||i < 2\Jnt. 



C + 1 



By the almost Euclidean property aY^||w||2 < ||w||i, it 
follows: 

llwlb < (II.4) 
By the definition of singular values, 

fT™„||x-x||2 < ||F(X-X)||2 = ||w||2, (II.5) 

SO combining ( III.4I ). we get 

II "II ^ 2(C+1) 
||x-x||2 < — -e. 

■ 

Note that when there are no sparse errors present, the 
decoding error bound satisfies ||x — x||2 < ^— e. Theorem 
12.11 shows that the decoding error of (III.lt is oblivious to 
the presence of bad data, no matter how large in amplitude 
these bad data can be. This phenomenon also observed in 
|1 U by using the restricted isometry condition for compressive 
sensing. 

We remark that, for given y and given e, by strong lagrange 
duality theory, the solution x to dll.ll i will correspond to 
the solution to x in the following problem ( III.6t for some 
Lagrange duality variable A > 0. As e > increases, the 
corresponding A that produces the same solution to x will 
correspondingly decrease. 



mm 

x.z 



Ux- z||i + Allzl 



(11.6) 



In fact, when A 



11.61 ) approaches 

n ||y--ffx||i, 



and when A — > 0, (III. 61 ) approaches 



UxW 



Thus, (III.6t can be viewed as a weighed version of l\ 
minimization and li minimization (or equivalently the LS 
method). We will later use numerical experiments to show 



that in order to recover a sparse vector from measurements 
with both noise and errors, this weighted version outperforms 
both £i minimization and the LS method. 

In the next two sections, we will aim at explicitly computing 



2(C+1) 

CTmina(C-l) 



^/n, which will denote zu later in this paper The 
appearance of the y/n factor is to compensate for the energy 
scaling of large random matrices and its meaning will be clear 
in later context. Next, we will compute explicitly the almost 
Euclidean property constant a. 

III. Bounding the Almost Euclidean Property 

In this section, we would like to give a quantitative bound 
on the almost Euclidean property constant a such that with 
high probability (with respect to the measure for the subspace 
generated by the random H), Q!y^||w||2 < ||w||i holds for 
every vector w from the subspace generated by H. Here we 
assume that each element of H is generated from the standard 
Gaussian distribution A^(0, 1). So the subspace generated by 
H is a uniformly distributed {n — m)-dimensional subspaces 
from the high dimensional geometry. 

To ensure that the subspace generated from H satisfies the 
almost Euclidean property with a > 0, we must have the 
event that the subspace generated by H does not intersect 
the set {w e 5""^|||w||i < ai/n||w||2}, where 5"'"^ is the 
Euclidean sphere in i?". To evaluate the probability that this 
event happens, we will need the following "escape-through- 
mesh" theorem. 

Theorem 3.1: IS] Let 5* be a subset of the unit Euclidean 
sphere S""^^ in i?". Let F be a random m-dimensional 
subspace of i?", distributed uniformly in the Grassmanian with 
respect to the Haar measure. Let w{S) — E{sup^^g{h^w)), 
where h is a random column vector in i?" with i.i.d. iV(0, 1) 
components. Assume that w{S) < {y/n — m— ^-^==). Then 



PiYf]S 



> 1 -3.5e- 



From Theorem 13. II we can use the following programming 
to get an estimate of the upper bound of w{h, S). Because the 
set {w G 5"^"'^|||w||i < q:y^||w||2} is symmetric, without 
loss of generality, we assume that the elements of h follow 
i.i.d. half-normal distributions, namely the distribution for the 
absolute value of a standard zero mean Gaussian random 
variables. With hi denoting the i-th element of h, this is 
equivalent to 



n 



max 



E 

i=l 



subject to 



hiUi 

Ho > 0,1 < i < n 



Ui < ay/n 



(iii.i) 

(III.2) 
(III.3) 

(III.4) 



Following the method from |7|, we use the Lagrange duality 



to find an upper bound for the objective function of (IIII.ll l. 

n 

min maxh^'^w — uif >^ wf ~ I) (III. 5) 



Ki>0,n2>0,A>0 w 



1=1 1=1 



(III.6) 



where A is a vector (Ai, A2, A„). 

First, we maximize (IIII.6I 1 over Wi, i — 1,2, ...,n for fixed 
ui, U2 and A. By making the derivatives to be zero, the 
minimizing Wi is given by 

hi + Xi ~-U2 



2ui 

Plugging this back, we get 



1 < i < n 



h^w-ui(^m?-l) 

n n 

-U2(^ Wi - a^/n) + At 

i=l i=l 

_ Er=i (-"2 + A, + h,)^ 



Wi 



(111.7) 
(III.8) 



ui + OL\fnu2. (III. 9) 



Next, we minimize ( |III.9t over u\ > 0. It is not hard to see 
the minimizing ul is 

* _ VEr=i (-"2 + A, + h,r 



and the corresponding minimized value is 



. ^ (-U2 + Ai + htY + ay/nu2- 
\ i=i 



(III. 10) 



Then, we minimize ( IIII.IOI ) over A > 0. Given h and U2 > 
0, it is easy to see that the minimizing A is 



A. 



U2 - hi if hi < U2; 
otherwise, 



and the corresponding minimized value is 

/ E {u2 - hiY + ay/nu2. 



(IILll) 



l<i<n\hi<CU2 



Now if we take any U2 > 0, (IIII.l II ) serves as an upper 
bound for (IIII.6b . Since is a concave function, by Jensen's 
inequality, we have for any given U2 > 0, 



E{suv{h^w))< E{ J2 i^2-h,)^} + Ci\fnu2- 

y \<^<n■.h,<U2 

(III. 12) 

Since h has i.i.d. half-normal components, the righthand side 
of ( IIII.12I ) equals to 



{\j {u\ + l)erfc(M2/\/2) - V2AM2e-"^ + 0^2)%^, 

(III. 13) 

where erfc is the error function. 

One can check that ( IIII.l 31 ) is convex in U2- Given a, we 




with cardinality |/i | = A; < n, 

l|w|li ^' 
then /? must be a number satisfying 



Proof: Without loss of generality, we let |lw]|i = 1. Then 
by the Cauchy-Schwarz inequality. 
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|WA'||2 



'K\\2 



Fig. 1: a* over m/n 



" y/n — k 



minimize ( IIII.13I ) over M2 > and let g{a)y^^ denote the 
minimum value. Then from (IIII.12I ) and ( IIII.13I ) we know 



K n ~ k 

At the same time, by the almost Euclidean property. 



w{S) = £;(sup(h^w)) < g{a 



(III. 14) 



Given S = ^, we pick the largest a* such that g{a*) < so we must have 



Vl — S. Then as n goes to infinity, it holds that 

1 



w{S) < g{a*)y/n < {\/n ~ m - 



2v^ 



m 



(III. 15) 



Then from Theorem 13.11 with high probability ||w||i > 
a*\/n||w||2 holds for every vector w in the subspace gen- 
erated by H. We numerically calculate how a* changes over 
S and plot the curve in Fig. [T] For example, when S = 0.5, 
a* = 0.332, thus ||w||i > 0.332^||w||2 for all w in the 
subspace generated by H. 

Note that when ^ = i, we get a = 0.332. That is much 
larger than the known a used in ifTSll . which is approximately 
0.07 (see Equation (12) in |15l). When applied to the sparse 
recovery problem considered in [15], we will be able to 
recover any vector with no more than 0.0289n = 0.0578r7i 
nonzero elements, which are 20 times more than the -^^m 
bound in lITSI . 

IV. 



384' 



Evaluating the Robust Error Correction 
Bound 

If the elements in the measurement matrix H are i.i.d. as 
the unit real Gaussian random variables N{0, 1), following 
upon the work of Marchenko and Pastur |10|, Geman|8 | and 
Silverstein [9J proved that for m/n — S, as n ^ oo, the 
smallest nonzero singular value 



1 



1-Vs 



a^nllwlln < llwll?, 



, (1-/3)^ ^ 1 
k n — k ~ o?i 



Corollary 4.2: If a nonzero ?i-dimensional vector w satis- 
fies ||w||i > ay^ll'^^lh, and for any set K C {l,2,...,ri} 
with cardinality \K\ — k < n, if C\\wk\\i — for 
some number C > 1, then 



k jB + l-C^)- ^{B + 1 - C^Y - 4B 
n - 2B 

where B = L£+pl, 

Proof: If C||wA'||i > l|w-j^||i, we have 

I|waHIi 1 



(IV.l) 



C + 1' 



So by Theorem 14.11 /3 — tti-t satisfies 



|w||i 
1 

c+i 



k 



n — k 
This is equivalent to 
1 

< 



1 



Solving this inequality for -, we get ( IIV.II) . 



almost surely as n oo. 

Now that we have already explicitly bounded a and (Tmin, 
we now proceed to characterize C. It turns out that our 
earlier result on the almost Euclidean property can be used 
to computed C. 

Theorem 4.1: Suppose an n-dimensional vector w satisfies 
||w||i > a-y/n||w||2. Then if for some set K C {l,2,...,n} 



So for a sparsity this corollary can be used to find 
C such that = Qjp^- Combining these results on 

computing Cmin, a and C, we can then compute the bound 



2(C+1) 
cr„,i„a(C-l) 

n 

Fig. ID 



^Jn = nj in Theorem 12.11 For example, when 
■i, we plot the bound as a function of - in 
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a 

Fig. 3: A* versus a for Gaussian measurements 



V. Numerical Results 

Experiment 1: We first consider recovering a signal vector 
from Gaussian measurements. We generate the measurement 
matrix TJ"^™ with i.i.d. A^(0, 1) entries and a vector x' e i?™ 
with i.i.d Gaussian entries. Let x = x'/||x'||2 be the signal 
vector. Let m = 60 and n = 150. We first consider the recover 
performance when the number of erroneous measurements is 
fixed. We randomly choose twelve measurements and flip the 
signs of these measurements. For each measurement, we also 
independently add a Gaussian noise from N(0, a^). For a 
given (T, we apply (III.6I 1 to estimate x using A from to 
13, and pick the best A* with which the estimation error is 
minimized. For each a, the result is averaged over fifty runs. 
Fig. [3] shows the curve of A* against a. When the number 
of measurements with bad data is fixed. A* decreases as the 
noise level increases. 

We next fix the noise level and consider the estimation 
performance when the number of erroneous measurements 
changes. Each measurement has a Gaussian noise indepen- 
dently drawn from A^(0,0.5^). Let p denote the percentage 
of erroneous measurements. Given p, we randomly choose 
pn measurements, and each such measurement is added with 
a Gaussian error independently drawn from A^(0,5^). The 



estimation result is averaged over fifty runs. Fig. H]shows how 
the estimation error changes as p increases for different A. 
A = 8 has the best performance in this setup compared with 
a large value A = 15 and a small value A = 0.05. 

Experiment 2: We also consider estimating the state of the 
power system from available measurements and known system 
configuration. The state variables are the voltage magnitudes 
and the voltage angles at each bus. The measurements can 
be the real and reactive power injections at each bus, and the 
real and reactive power flows on the lines. All the measure- 
ments are corrupted with noise, and a small fraction of the 
measurements contains errors. We would like to estimate the 
state variables from the corrupted measurements. 

The relationship between the measurements and the state 
variables for a fc'-bus system can be stated as follows l,12J : 

k' 

p, = ^i;,£;jF„cos(% + 5, -5,), (v.i) 

fe' 

- Y,E^EjY,,sm{d,j+5^-6j), (V.2) 
i=i 

Pij = EiEjYij cos{9ij +6i- S-i) 

-EfY.j cos e,j + EfY,, cos 0,, i ^ j, (V.3) 

Qtj = E^EjY^jSm{9ij + Si - Sj) 

-EfY,, sin % + EfY,^ sin 0,, i ^ j, (V.4) 

where Pi and Qi are the real and reactive power injection at 
bus i respectively, Pij and Qij are the real and reactive power 
flow from bus i to bus j, Ei and 6i are the voltage magnitude 
and angle at bus i. Yij and 9ij are the magnitude and phase 
angle of admittance from bus i to bus j, Ysi and Ogi are the 
magnitude and angle of the shunt admittance of line at bus i. 
Given a power system, all Yij, 9ij, Ysi and 9 si are known. 

For a fc'-bus system, we treat one bus as the reference bus 
and set the voltage angle at the reference bus to be zero. There 
are TO = 2k' — 1 state variables with the first k' variables for 




the bus voltage magnitudes Ei and the rest k' — 1 variables 
for the bus voltage angles 6i. Let x € i?™ denote the state 
variables and let y £ i?" denote the n measurements of the 
real and reactive power injection and power flow. Let v e i?" 
denote the noise and e £ i?" denote the sparse error vector. 
Then we can write the equations in a compact form, 

y = /i(x)+v + e, (V.5) 

where h{-) denotes n nonlinear functions defined in dV. Il l to 

Na\ . 

An estimate of the state variables, x, can be obtained by 
solving the following minimization problem, 

min ||y- /i(x) -z||i + A||z||2, (V.6) 

x.z 

where x is the optimal solution x. A > is a fixed parameter. 
When A oo, (IV.6b approaches 

min ||y-/i(x)|li, (V.7) 

X 

and when A — ?> 0, (IV.6b approaches 

min ||y-;i(x)||2. (V.8) 

X 

Since h is nonlinear, we linearize the equations and apply 
an iterative procedure to obtain a solution. We start with the 
initial state x° where x^=l for alH € {1, n}, and x^^O 
for alH e {n + 1, 2n — 1}. In the fcth iteration, let Ay*^' = 
y-/i(x'^-i), then we solve the following convex optimization 
problem, 

min |lAy'=-FAx-z|li+A|lz|l2, (V.9) 

Ax,z 

where iJ"^™ is the Jacobian matrix of h evaluated at x'^'^^. 
Let Ax*^ denote the optimal solution Ax to (|V.91 l. then the 
state estimation is updated by 

x*-' ^x''^"^ + Ax''. (V.IO) 

We repeat the process until Ax*' 0. 

We evaluate the performance on the IEEE 30-bus test 
system. Fig. |5] shows the structure of the test system. Then the 
state vector contains fifty-nine variables. We take one hundred 



measurement including the real and reactive power injection 
at each bus and some of the real and reactive power flows on 
the lines. We first consider how the estimation performance 
changes as the noise level increases when the erroneous 
measurements are fixed. The errors of the measurements are 
simulated by inverting the sign of the real power injection at 
bus 2, bus 3, bus 5, bus 26 and bus 30, and inverting the sign of 
the reactive power injection at bus 30. Each measurement also 
contains a Gaussian noise independently drawn from iV(0, ct^). 
For a fixed noise level a, we solve ( IV.6I) by the iterative 
procedure using different A (from 0.5 to 12). The estimation 
performance is measured by ||x* — x||2, where x* is the true 
state variable and x is our estimation. For a fixed a, we choose 
the A* to be the one with which |jx* — x||2 is minimal among 
all the A's we consider. The result is averaged over fifty runs. 
Fig. |6] shows how A* changes as a increases from to 0.2. 
When the noise level is low, i.e. the measurements basically 
only contain errors, then the estimation performance is better 
when we use a larger A. When the noise level is high, a smaller 
A leads to a better performance. 

We also study how the estimation performance changes 
as the number of erroneous measurements increases. Each 
of the one hundred measurements contains random Gaussian 
noise independently drawn from A^(0, 0.05^). Let p denote the 
percentage of erroneous measurements with bad data. For fixed 
p, we randomly choose the set T of erroneous measurements 
with cardinality |T| = pm. Each erroneous measurement 
contains an additional Gaussian error independently drawn 
from iV(0, 0.7^). We than calculate the solution x of ( IV.6I ) and 
the estimation error ||x*— x||2. Fig.Qshows how the estimation 
error changes as p increases. The results are averaged over fifty 
runs. When A is small (A = 0.5), ( |V.6t approaches (|V.7| l, and 
the estimation error is relatively large if p is small, i.e. the 
measurements basically contain only noise. When A is large 
(A = 12), ( IV.6b approaches dV.Sb . and the estimation error 
is relatively large if p is large, i.e. the measurements contains 
errors besides noise. In contrast, if we choose A to be 7 in this 
case, the estimation error is relatively small for all p among 
the three choices of A. 




Fig. 7: The estimation error versus p 



VI. Conclusion 

In this paper, we study state estimation through observations 
corrupted with both bad data and additive observation noises. 
A mixed £i and £2 convex programming is used to separate 
both sparse bad data and additive noises from the observations. 
We used the almost Euclidean property of a linear subspace to 
provide sharp bounds on this convex programming based state 
estimation method. We also give sharp bounds for the almost 
Euclidean property of a linear subspace using the "escape- 
through-a-mesh" theorem from geometric functional analysis 
Is). We then propose an iterative convex programming based 
methods to perform state estimation with bad data detection in 
the nonlinear electrical power network problems. Simulation 
results confirm the effectiveness of the algorithms in denoising 
and detecting bad data at the same time. 

Acknowledgment 

The research is supported by NSF under CCF-0835706 and 
ONR under N00014-1 1-1-0131. 

References 

[1] Anjan Bose and Kevin Clements, "Real-Time Modeling of Power Net- 
works," Proceedings of IEEE, 75(12), 1607-1622 1987 

[2] A. Monticelli and A. Garcia, "Reliable bad data processing for real time 
state estimation," IEEE Trans. Power App. Syst., vol. PAS-102, pp. 1126- 
1139, May 1983. 

[3] BroussoUe, "State estimation in power systems: Detecting bad data 

through the sparse inverse matrix method," IEEE Trans. Power App. Syst., 

vol. PAS-94, pp. 329-337, Mar/Apr 1975. 
[4] Emmanuel Candes and Terence Tao, "Decoding by linear programming", 

IEEE Trans, on Information Theory, 51(12), pp. 4203 - 4215, December 

2005. 

[5] Y. Gordon, "On Milman's inequality and random subspaces with escape 

through a mesh in ij"". Geometric Aspect of Functional Analysis, Isr 

Semin. 1986-87, Led. Notes Math, 1317,1988. 
[6] M. Rudelson and R. Vershynin, "On sparse reconstruction from Fourier 

and Gaussian measurements". Comm. on Pure and Applied Math., 61(8), 

2007. 

[7] M. Stojn ic, "Various thresholds for -optimization in Compressed Sens- 
ing," http://arxiv. org/abs/0907 . 3666 

[8] S. Geman, "A limit theorem for the norm of random matrices," Annals 
of Probability, vol. 8, No. 2, pp.252-261, 1980 

[9] J. Silverstein, "The smallest eigenvalue of a large dimensional Wishart 
matrix," Annals of Probability, vol. 13, pp. 1364-1368, 1985 



[10] V. A. Marcenko and L. A. Pastur, "Distributions of eigenvalues for some 
sets of random matrices," Math. USSR-Sbomik, vol. 1, pp. 457-483, 1967. 

[11] E. Candes and P. Randall, "Highly robust error correction by convex 
programming." IEEE Transactions on Information Theory, vol. 54, pp. 
2829-2840. 

[12] W. Kotiuga and M. Vidyasagar, "Bad Data Rejection Properties of 
Weighted Least Absolute Value Techniques Applied to Static State 
Estimation," IEEE Trans. Power App. Syst., vol. PAS-101, pp. 844-853, 
Apr 1982. 

[13] O. Kosut, L. Jia, R. J. Thomas, and L. Tong, "On Malicious Data Attacks 
on Power System State Estimation," Proceedings of f/P£C,Cardiff, Wales, 
UK, 2010. 

[14] O. Kosut, L. Jia, R. J. Thomas, and L. Tong, "Malicious Data Attacks 
on Small Grid State Estimation: Attack Strategies and Countermeasures" 
Proceedisng of IEEE 2010 SmartGridComm, 2010. 

[15] Yin Zhang, "A Simple Proof for Recoverability of £1- 
Minimization: Go Over or Under?" Technical Report, 
|http://www.caam.rice.edu/~yzhang/reports/tr0509.pdf I 



